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We present a modelling framework, and basic model parameterization, for the study of DNA 
origami folding at the level of DNA domains. Our approach is explicitly kinetic and does not assume 
a specific folding pathway. The binding of each staple is associated with a free-energy change that 
depends on staple sequence, the possibility of coaxial stacking with neighbouring domains, and the 
entropic cost of constraining the scaffold by inserting staple crossovers. A rigorous thermodynamic 
model is difficult to implement as a result of the complex, multiply connected geometry of the 
scaffold: we present a solution to this problem for planar origami. Coaxial stacking of helices 
and entropic terms, particnlarly when loop closnre exponents are taken to be larger than those 
for ideal chains, introduce interactions between staples. These cooperative interactions lead to the 
prediction of sharp assembly transitions with notable hysteresis that are consistent with experimental 
observations. We show that the model reproduces the experimentally observed consequences of 
reducing staple concentration, accelerated cooling and absent staples. We also present a simpler 
methodology that gives consistent results and can be nsed to study a wider range of systems including 
non-planar origami. 


I. INTRODUCTION 

Recently, the exquisite specificity of Watson-Crick base 
pairing has been harnessed to create artificial nanoscale 
structures from DNA [iHin]. One of the most popular 
techniques for assembling DNA nanostructures is known 
as DNA origami. Pioneered by Rothemund [3], this ap¬ 
proach involves the folding of a long “scaffold” strand of 
DNA (often the genome of the M13 bacteriophage, ap¬ 
proximately 7300 bases in length) by hybridization with a 
set of much shorter “staple” strands. Most staples are de¬ 
signed to be complementary to two or more distinct parts 
of the scaffold; hybridization to staples causes the scaf¬ 
fold to fold into a target shape. DNA origami has been 
used to construct a wide variety of 2D and 3D structures 
[3 ME [ini- Origami nanostructures can be function¬ 
alized with sub-nanometre precision, allowing them to 
function as molecular breadboards for the construction of 
optical devices m and light-harvesting complexes 
tracks for decision-making robots [mn], and scaffolds 
for studying enzymatic cascades m- Origami struc¬ 
tures can act as ‘frames’, ‘rulers’ and ‘handles’ for single¬ 
molecule manipulation minj or harnesses for connect¬ 
ing multiple motor proteins [TS] . Other suggested appli¬ 
cations include drug delivery [laiin]- 

Although DNA origami has been remarkably success¬ 
ful, open questions about the assembly process remain. 
Folding yield is variable, and even when a structure is 
largely folded missing staples could affect its mechani¬ 
cal properties m and addressability. An understand¬ 
ing of general principles for optimizing yield and folding 
rate would allow the improvement of current designs and 
the assembly of larger and more complex structures. A 


number of recent experiments have probed the details of 
origami folding . Assembly is cooperative [231 US] j 

occurs across a narrow temperature window [23] , exhibits 
hysteresis |23l |26l El] and is highly sensitive to staple de¬ 
sign |23H23]. 

Current theoretical modelling is very limited. Al¬ 
though the thermodynamics of isolated duplex formation 
are well understood and quantified [29] , it is not possible 
to model origami formation as a sequence of independent 
staple-binding events without losing much of the under¬ 
lying physics. In particular, such an approach cannot de¬ 
scribe cooperative interactions whereby the presence of 
a bound staple affects the binding of other staples to the 
scaffold. Recent evidence suggests that cooperative in¬ 
teractions affect origami folding significantly 1231 Ea Eli, 
leading to sharp formation transitions as the tempera¬ 
ture is lowered and contributing to hysteresis in heating 
and cooling experiments. 

The physical causes of cooperativity during origami 
folding are subtle. One might hope to draw analogies 
with the nucleation of crystals: until a crystal exceeds 
a threshold size, the lower free energy of the crystalline 
phase is offset by a substantial interfacial cost. Theoreti¬ 
cal work has suggested that nucleated growth does occur 
in another class of DNA nanostructures that assemble 
from many short strands (“bricks”) without a guiding 
scaffold m- In this case, cooperative, nucleated assem¬ 
bly is expected: a high proportion of the bricks of a pre¬ 
nucleated structure are only bound to one other brick, 
whereas bricks binding to a larger structure can form two 
or more bonds. For DNA origami, however, the analogy 
with crystal nucleation is not so obvious. Staples bind to 
the scaffold only, not to other staples, and can hybridize 
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to the same number of scaffold domains at any stage of 
assembly. Arbona et al. [22l [26] have presented a model 
of origami folding in which cooperativity arises from two 
effects. 

• A bound staple generally holds two or more distinct 
parts of the longer scaffold strand in close proxim¬ 
ity, forming scaffold loops and incurring an entropic 
penalty. Arbona et al. postulate that the binding 
of one staple to the origami might facilitate the 
binding of a second by shortening the loop which 
it must enclose pS] . 

• Arbona et al. include a phenomenological term 
through which the density of nearby staples is taken 
to stabilize the binding of an additional staple to 
the scaffold; they suggest that attractive interac¬ 
tions between strands mediated by divalent cations 
might be responsible for such an interaction [26j . 

Although an important contribution, there are some 
drawbacks to their approach. For the sake of tractability, 
staple binding is assumed to follow a single, well-defined 
pathway. This assumption means that the actual staple 
binding probabilities in the steady state are not consis¬ 
tent with the free energies used as inputs to the model; 
the consequences of this discrepancy are not explored. 
There is no timescale in the model, so it is unclear how 
properties such as apparent hysteresis should be inter¬ 
preted. 

In this work we demonstrate a general ‘global’ model 
for the folding of any origami that has a structure that 
can be represented as a planar graph. The global model 
is thermodynamically self-consistent, has explicit kinet¬ 
ics and allows for the analysis of competing folding path¬ 
ways. The model naturally generates cooperative effects 
in staple binding: cooperativity arises as a result of the 
influence of bound staples on the proximity of the bind¬ 
ing domains of other staples and from coaxial stacking 
of duplexes. We explore changes in origami folding as 
the strength of the cooperativity factors are varied, and 
demonstrate that the model is consistent with experi¬ 
mental observations of the consequences of reducing sta¬ 
ple concentration, accelerated cooling and absent staples. 
We also present a simplihed ‘local’ model which can be 
extended to non-planar graphs. The local approach is not 
a true thermodynamic model, but it does give similar re¬ 
sults to the global model and deviations are systematic 
and explicable. 

We recently used the local model to predict folding 
trajectories in an unusual origami with a scaffold com¬ 
posed of two identical halves [3T]. In this system, two of 
each staple type can bind to a single scaffold in two pos¬ 
sible configurations, leading to a vast number of possible 
hybridized structures among which are a number of ap¬ 
proximately degenerate but geometrically distinct well- 
formed assembly products. The distribution between 
well-formed products is strongly dependent on cooper¬ 
ative effects in staple binding, and the model is remark- 



FIG. 1. A simple origami design studied here. Two-domain 
staples are shown in blue and black, and single-domain staples 
in grey. The scaffold, shown as a thin black line, is a long 
loop of DNA that hybridizes to each domain of each staple. 
A specific implementation of this design is described in Ref. 

m- 



FIG. 2. Transitions in the model. Four possible bonding 
configurations of two domains of a scaffold which can bind to 
staple i are shown. From left to right these configurations are 
denoted Pi = 00, pi = 01, 10 (half-bound) and pi = 11 (fully 
bound). A fifth state with two half-bound staples {pi = 12) 
is not shown. 


ably successful in predicting the effects of rationally en¬ 
gineered changes in staple-scaffold interactions. 


II. MODEL AND METHODS 
A. Model state space 

We model the folding of an isolated scaffold surrounded 
by a large excess of staples and describe the folding of this 
origami at the level of domains. Such an abstraction is 
common in models of complex nucleic acid nanostruc¬ 
tures and strand displacement systems [551130113111331 ■ 
A domain is a contiguous series of bases, typically 5 to 
20 nucleotides long, that is designed through its base se¬ 
quence to bind selectively to a complementary domain as 
a cooperative unit. Origami scaffolds typically have a few 
hundred domains whereas staples typically have between 
one and hve. 

For simplicity, we will restrict examples in this work 
to staples with at most two domains (Fig. [^, although 
our approach can be generalized to staples with more 
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domains (see Appendix 0- We describe a two-domain 
staple as half-bound if only one domain is hybridized to 
the scaffold and fully bound if both domains are bound 
(Fig. We assume that only fully complementary do¬ 
mains can hybridize, ignoring weaker interactions that 
result from inevitable partial sequence complementarity 
between other pairs of domains. These interactions could 
be included, but at a high price, as this would signifi¬ 
cantly expand the state space of the model. We note 
that it would be an even harder problem to consider all 
possible interactions that are not consistent with the ab¬ 
straction to domains, including hybridization of a staple 
to more than one scaffold domain and the formation of 
secondary structure within the scaffold itself. 

Let the design consist of k staples and let pi denote 
the bonding configuration of the scaffold domains which 
interact with the i-th staple. For single-domain staples 
we dehne pi G {0,1} where 

• 0: a staple is not bound to the scaffold domain; 

• 1: a staple is bound to the scaffold domain. 

For two-domain staples we have Pi G {00,10,01,11,12} 
where: 

• 00: no staple bound to either scaffold domain; 

• 10: a single staple is bound to the first domain, the 
second domain is empty; 

• 01: a single staple is bound to the second domain, 
the first domain is empty; 

• 11: a single staple is bound to both domains; 


the assumption of mass-action kinetics, the concentration 
[AB] is described by 

'^=k+[A][B]-k.[AB] (1) 

for rate constants k+^k-. The equilibrium concentra¬ 
tions {A}, {B} and {AB} then follow 


im 

{A}{B} 


k_ 


= exp 


A ^0 duplex ^ 

RT 


X M 


-1 


( 2 ) 


where R denotes the molar gas constant and T tem¬ 
perature. is the 

sequence-specific Gibbs free energy change for duplex for¬ 
mation at molar concentration (1M), and AiL^g“P^®^ and 
are the corresponding changes in enthalpy and 
entropy. Throughout this work, superscript 0 will re¬ 
fer to quantities defined with respect to this standard 
concentration (the factor of arises from dimen¬ 

sional considerations). For a given duplex, these quan¬ 
tities can be estimated through the widely-used nearest- 
neighbour model of SantaLucia [5S] in which 
and are assumed to be T-independent. 

To use a similar description for origami staple binding, 
we need to establish an expression for AG° g/, the differ¬ 
ence between the free energies of partially-folded inter¬ 
mediate states s' and s. A first approximation to AGg g, 

might be AG® which takes into account only the 

free energies of hybridized domains calculated as if each 
hybridized duplex were formed in isolation. However, 
this ignores significant interactions between different sec¬ 
tions of the origami which depend on the state of the 
folding. Instead, we take 


• 12: a distinct staple is bound to each domain. 


AG° g, = AG° + AGfg'P*^ + AGf (3) 


The state of the scaffold is given by the bonding con¬ 
figuration of the domains. Denote s = (po,Pi;... jPfe-i) 
and let the set of states be S. Then the size of the state 
space is [S'! = 2^ x 5* where j is the number of single¬ 
domain staples and I the number of two-domain staples. 
This description can be extended to staples with more 
than two domains, although we will focus on the simple 
case here. Typically, each scaffold domain has a unique 
base sequence, and this restriction is implicit in the state 
space presented here. In Ref. [3T], we show how the 
state space of the model can be extended to handle scaf¬ 
folds with repeated sections that allow multiple bonding 
configurations for each staple. 


B. Kinetic descriptions 


where AGg^gT** represents the contributions from the en- 
tropic costs of scaffold loop formation Ennaiszi, and 
AG|*}f^ is a contribution from the coaxial stacking of du¬ 
plex sections [55MT] . We discuss our approxi mation s for 
the various contributions to AG° in Section 


lie 


The value of AG® g/ associated with hybridization of 
a domain determines the ratio between binding and un¬ 
binding transition rate constants, but does not determine 
their absolute values. Given AGgg,, we calculate the 
transition rate for the binding of the first arm of a staple 
as follows. Gonsider an isolated origami in a partially 
folded state sqo with pi = 00 for a particular staple type 
i, and let a staple i bind to the scaffold by a single do¬ 
main, resulting in state sqi with pi = 01 (Fig. [^. We 
take this transition rate to be equal to that for isolated 
duplex formation 


Simple kinetic models for bimolecular reactions of com¬ 
plementary DNA strands are well established in the liter¬ 
ature [331133] . Let complementary strands A, B bind re¬ 
versibly, forming the double-helical complex AB. Under 


o-(soo,soi) = fc+[i], (4) 

where we use cr(s, s') to denote the rate with which a 
scaffold makes a transition from state s to s'. Since the 
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rate of formation of an isolated duplex is known to be 
more weakly dependent on duplex stability than the cor¬ 
responding unbinding rate [51], we assume that is 
independent of temperature, domain sequence, and fold¬ 
ing state, and we fix k+ = 10® M“^s“^ as a reasonable 
first approximation [Ml [Ml l42] . Throughout this work 
we will assume that the excess staple concentrations are 
sufficiently high that free staple concentrations can be 
taken to be constant. Note that this assumption does 
not mean that staple concentrations are high in abso¬ 
lute terms, only that they are sufficiently in excess of the 
very dilute scaffold strands. The rate (t(soijSoo) for the 
reverse reaction is then given by: 


^(soijSoo) = k+ exp 


RT j 


X M. 


(5) 


1. Duplex free energies 

AG° is calculated using the well-established San- 
taLucia parameterization of the nearest-neighbour model 
of DNA thermodynamics IM] • We assume buffer con¬ 
ditions of 40 mM Tris and 12.5 mM Mg^'® and apply an 
additional buffer-dependent entropic penalty for duplex 
formation [551155115^ : 

A^o.sait ^ Q 3gg X — X In [Tris] -f 3.3[Mg2+]i/^ 

(9) 

where N is the number of phosphates in the duplex (for 
a duplex of length d base pairs, we take N = 2{d — 1)). 


The binding and unbinding transitions of the second do¬ 
main of staple i to form sn with pi = 11 are associated 
with the thermodynamic constraint 


^■(soi, Sll) 
Soi) 


= exp 


RT 


( 6 ) 


To resolve the ambiguity in the absolute values of 
ct(so 1 :Sii) and (t(sii,soi)) we make the assumption 
that the unbinding rate cr(sii,soi) is equal to the 
temperature- and sequence-dependent unbinding rate for 
the corresponding isolated duplex (with a correction for 
coaxial stacking): 


^(siijSoi) = k+ exp 


/ AGW- +AGf-^^^ 
RT 


X M. 

(7) 


AGlIjj^P®^^ represents the free-energy change correspond¬ 
ing to the change in the geometric constraints imposed 
by staple crossovers on the part-folded origami. In our 
model, these constraints are manifest as a modified bind¬ 
ing rate for the second staple domain. Using Eqs. [^and 
[7] we find 


/_y\^shape 

o-(soi,sii) = fc+exp ( - 


X M. 


( 8 ) 


Eqs. Hilzl and imply that unbinding transition rates 
grow rapidly with temperature, whereas binding rates are 
essentially constant. This feature of the model is consis¬ 
tent with a physical picture in which the barrier to du¬ 
plex opening is primarily enthalpic, whereas the barrier 
to duplex formation is primarily entropic. 


C. Free energy model 

In this section we outline the calculation of various 
contributions to AG® . 


2. Coaxial stacking free energies 

It is well known that coaxial stacking of bases across 
a nick in the DNA backbone can stabilize the flanking 
duplexes [55MT] . We add a coaxial stacking contribu¬ 
tion AG®*®'®’'(T) to the free energy of each state wher¬ 
ever two adjacent scaffold domains are both hybridized 
to staples. The change in coaxial stacking free energy 
for a transition between states is AG|*gf’'. Geometrical 
limitations that prevent coaxial stacking, such as bends 
in the scaffold routing, are ignored. We parameterize the 
coaxial stacking strength by AG®*‘‘®‘'(T) = n(AG®P(T)), 
where (AG®p(T)) is the sequence-averaged free-energy 
gain per base pair in the nearest-neighbour model of Ref. 
[55] . AG®®®'®’'(r), which has been considered by SantaLu- 
cia and others [551 - HT] . is much more weakly constrained 
than other aspects of the nearest-neighbour model. We 
thus treat n as an adjustable parameter of the model, 
and explore the consequences of varying n in Section |TTT| 


3. Scaffold shape free energies 

The presence of m fully-bound, two-domain staples 
puts m additional constraints on the conformation of the 
origami, as each crossover between staple domains brings 
non-adjacent scaffold domains into close proximity, pin¬ 
ning part of the scaffold into a loop. The existence of 
m looping constraints due to m staples suggests that we 
can decompose 


Gf ^Pe - AG‘°°p (10) 

L{s) 

where L(s) is a set of m loops present in the partially 
folded structure s, and G^^jP® is the reference conforma¬ 
tional free energy for a scaffold with no staples. Eq. [I^ 
is deceptively simple, but before we propose functional 
forms for AG*°°p we must decide which loops should be 
included in the set L{s). To this end we represent the 
partially folded origami as a graph in which the circular 
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FIG. 3. (a): A scaffold with two fully-bound staples, (b): 

Ambiguity in selecting a minimal set of loops from an em¬ 
bedded graph constructed from (a). To construct an abstract 
graph based on (a), we draw edges that correspond to the 
scaffold strand and the crossovers between staple domains. 
Planar projections of this graph ((i)-(iv)) allow the identifi¬ 
cation of faces that are either internal (green arrows) or ex¬ 
ternal (red arrows) with respect to the scaffold. A staple-free 
scaffold has two faces (one internal and one external); the ad¬ 
dition of each two-domain staple creates an additional face. 
However, as demonstrated by (i)-(iv), multiple planar em¬ 
beddings of the same graph are possible, each giving a set of 
faces that are encircled by distinct loops of DNA. This ambi¬ 
guity can be resolved by specifying a priori which staples are 
external to the scaffold; if black staples are internal and blue 
external, (b.iii) is the appropriate planar projection. 


scaffold is a chain of linked vertices, one for each domain, 
and each fully-bound staple is an edge connecting the cor¬ 
responding vertices. We can identify loops on this graph 
by finding cyclic paths that do not traverse the same edge 
twice. The total number of such loops (or simple cycles) 
that can be drawn, however, grows exponentially with 
the number of bound staples. It is not obvious how one 
might algorithmically choose the set of m loops that gives 
the most physically meaningful representation of 
for state s. 

This combinatorial difficulty seems hard to understand 
when one looks at a schematic representation of a folded 
origami, such as Fig. It seems almost trivial to identify 
a unique set of physically meaningful loops, one for each 
two-domain staple. In the case of flat origami [3], the as¬ 
sociated graph has a natural planar embedding given by 


the two-dimensional shape where the self-evident loops 
are equal to the faces in the embedding. Inspired by 
this observation, we extend the concept of planar embed¬ 
ding to partially-folded states (for a discussion of multi- 
domain staples, see Appendix]^. For a given planar em¬ 
bedding, it is easy to identify faces (and hence loops) 
within an origami, as illustrated in Fig. However, 
Fig. [3] also shows that a given graph has multiple pos¬ 
sible planar embeddings. To resolve this ambiguity, we 
specify whether the edge associated with each staple lies 
on the inside or on the outside of the scaffold on the basis 
of the intended origami structure. In the simple origami 
design shown in Fig. the staples on the outside of the 
scaffold are blue, while the inner staples are black. This 
makes the planar embedding of the graph unambiguous 
at any stage of folding (Fig. [^. We can therefore asso¬ 
ciate the set of loops L{s) appearing in Eq. 10 with the 
set of loops that enclose the faces of the planar graph of s. 
The addition of any two-domain staple to any partially- 
formed origami state increases the number of faces in the 
planar graph by one, and this face can be unambiguously 
identihed. In Section [II C 5| we propose a functional form 
for AG^°°P which depends on the structure of the DNA 
loop encircling the relevant face. Because a graph with 
m two-domain staples has to -I- 2 faces, we necessarily ob¬ 
serve one looping constraint (face) too many, a subtlety 
we discuss in Appendix]^ We note that this formalism 
is not directly related to the terminology of ‘interior’ and 
‘external’ loops used in the secondary-structure predic¬ 
tion community |44] . Each additional loop in our model, 
be it assigned to the inside or the outside of the scaffold 
embedding, represents a physical looping constraint, and 
would be described as an interior loop in Ref. [44] . 

In our ‘global’ approach, we recalculate ex¬ 


actly (within the approximations of the model) at each 
stage. We describe this approach as ‘global’ because we 
explicitly consider the consequences of an incoming (or 
outgoing) staple for all loops, not just the one most ob¬ 
viously associated with the staple in question. If we do 
not, the resultant dynamics would not reach an equilib¬ 
rium state characterized by AG® Asa simpler alterna¬ 
tive, we propose a ‘local’ model for estimating 
in which we include only the cost of the smallest (mini¬ 
mal) loop that forms or is disrupted during a transition 
and neglect the effects of the transition on other loops in 
the graph. In the local model 


AG^^h-Pe 


AG|))°® if a loop forms under s —>■ s', 

—AGJ))°® if a loop breaks under s —)■ s', 

0 otherwise. 

( 11 ) 


A loop forms (breaks) if a single domain of a two-domain 
staple binds (unbinds) while the other domain is attached 
to the origami. We identify the minimal loop as the cy¬ 
cle on the origami graph that has the smallest value of 
Ag'°°p of those containing the edge that corresponds to 
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the staple crossover in question. To perform this calcula¬ 
tion, we use the graph of the state in which the domain 
in question is bound (for maximal consistency with the 
global model). 


4- Illustrating the global and local approaches 


For the sake of clarity, we illustrate which loops are 
identified when a single staple binds to a simple scaffold 
(see Fig.[^, for both the local and global models. Actual 
reaction rates depend on the parameterization discussed 
in Section |II C 5 example calculations are given in Ap¬ 
pendix 


For the global model, changes to each face of the planar 
graph are evaluated at each step. There is a small but 
non-zero cost 


^^shape ^ -h agI;)°p - (agJ.^p -f ag!;’°p) 

( 12 ) 

associated with the transition from the state in Fig.|^a to 
Fig.0b, in which the binding of the first arm of a staple 
alters the physical properties of existing loops, but does 
not create a new loop. Note that AGJ)’°p does not appear 
in Eq.[l^ since that loop is unchanged. This AG^^^'P® is 
manifest in a changed off-rate (Eq.[^. Eor the transition 
from Fig. Sb to Fig.j^c, the graph topology changes and 

^(^shape =agJ(’°p -1- AG^°°P -h AG^°°P 

- (AG|(’°p -f AGy°P). (13) 


This AG^^'^^P® is manifest in a changed on-rate for the 
second arm (Eq.[^. 

For the local model, AG®*'*^p® is zero for the transition 
between the states shown in Fig.|^a and Fig.|^b, and is 
equal to min(AG^°°P, A^°°p) for the binding of the sec¬ 
ond arm (Fig. 0b to Fig.j^c), when a new loop is formed. 

Although in principle the two models seem quite dif¬ 
ferent, in practice the numerical values for AG^^'^^p® are 
quite similar, which we explain as follows. Firstly, the 
change in AG*°°p due to a constituent domain becoming 
doubl e stranded is generally not that large (see Section 
lie5 a nd App endix [C|). Secondly, as will be discussed in 


Section IIC 5 AG^°°p is logarithmic in loop length. Thus 
when a loop is split into two (as in Fig. 0b to Fig.|^c), cre¬ 
ating a larger and a smaller daughter loop, the contribu¬ 
tion to AG^^^'P® from the smaller daughter is much larger 
than the difference in AG^^'^p® between the original loop 
and the larger daughter. Therefore simply considering 
the smallest loop that forms is quantitatively reasonable. 


We note, however, that although numerical values of 
AG®^ape a^]-g similar, there is a fundamental difference 
between the two approaches. Consider two staples that 
can bind to an otherwise empty scaffold (Fig. [^. There 




FIG. 4. Loops present in three configurations of a simple 
origami. Loop a comprises the entire scaffold. Additional 
loops are created by the connections between scaffold domains 
formed by staples. Primed labels indicate loop costs that have 
changed as a result of the conversion of a constituent domain 
from single-stranded to double-stranded DNA. Topological 
changes are indicated by changes in the labels themselves. 



FIG. 5. Thermodynamic inconsistency in the local model: 
the route A —>■ B —>■ D produces a different change in free 
energy from A —>■ G —>■ B. 


are four fully-bound states: 

SA = (00,00), SB = (00,11), 

sc = (11,00),SB = (11,11). (14) 

In the global model, each state is assigned a well-defined 
free energy, including the shape contribution AG®’'®'P®: 
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the free energy is necessarily a function of state and 
changes in free energy are independent of pathway. How¬ 
ever, this is not the case in the local model. Fig. [^indi¬ 
cates free-energy changes associated with two paths from 
state A to state D. If the system were rigorously thermo¬ 
dynamically well-defined, as in the global model, then 
the following equality would necessarily hold: 


AGfTZ + ( 15 ) 


It is clear that the sets of ‘minimal’ loops used in the 
local model to calculate the changes in free energy along 
the two pathways are different, so Eq. [^will not hold in 
general in the local model. The local model is therefore 
not thermodynamically self-consistent. 

Although the local approach does not lead to a well- 
defined G^hape configuration, it is computation¬ 

ally less demanding than the global version and has the 
advantage of supporting non-planar origami designs. In 
this work we will compare the two approaches to explore 
whether the local model can reasonably be used to study 
more complex systems for which the global approach 
might be impractical. The representation of origami as 


a graph is discussed further in Section IIP 1 


5. Estimating loop free energy 

Both the local and global approaches necessitate the 
calculation of a loop free-energy cost In this sec¬ 

tion, we outline a simple estimate of AG*°°p, and identify 
the key variables that lead to cooperativity. 

Our approach is analogous to that of Jacobson and 
Stockmayer m- We approximate the free-energy cost of 
loop formation by 

AGioop ^ In . (16) 

Here, is the probability that the DNA which consti¬ 
tutes a loop associated with a particular crossover spon¬ 
taneously adopts a conformation in which its ends are 
within an arbitrary small distance r,. of each other with¬ 
out being constrained to be there by the crossover. P'^a 
is the probability that two unconnected molecules would 
be within r^. in a hypothetical ideal system of volume 
where 

= P X M-‘ (17) 

Na = 6.022 • 10^^ X mor^ (18) 

In the expression for AG^°°p, arises to correct for the 
difference between a free energy of association of sepa¬ 
rate strands under standard molar conditions (which is 
equal to the hypothetical free energy of association of 
two isolated strands in a box of volume [45]), and the 
free energy of forming a loop. AG'^’^’p thus quantifies the 
effective concentration of one loop end in the vicinity of 


the other, relative to standard conditions. For a further 
justification of this approach, see Appendix [Pj 


For a loop that is formed in the transition sqi sii 
in the local model, it follows from Eqs. [^ 11 and 16 that 


a(soi, sn) = k+ exp {-AG^^^^JRT) x M, 
= fc+exp(-AG|^°P/Pr) X M, 


= fc-i 


1 X M, 


\A'Kr^NA/i ) 


(19) 


It is clear from comparison with the binding rate of the 
first arm of a staple that exp AG|((°(’/i?r^ x M is an 
effective concentration that determines the loop closure 
rate. Calculation of AGlOf^J* in the local model then re¬ 
duces to estimation of P^^^p, the maximum spontaneous 
looping probability for a cycle that contains the edge of 
the staple in question. 

As a first approximation to P[^^p, we treat the DNA 
that forms the loop as a freely-jointed chain consist¬ 
ing of two distinct segment types, duplexes and single- 
stranded DNA. Let the end-to-end distance of the chain 
be given by r, denote P{r) its probability distribution, 
then Pjoop = /g'’ P{r)dr is the probability that the ends 
of the loop are within r^. For a chain with m distinct seg¬ 
ment types, in the limit of a large number of segments. 


P{r) = dTrr^ 


\2Tr E[r 


f _3r.2 \ 


( 20 ) 


where E[r^] = J2i<m mean squared distance 

between the two ends. Here, Ni is the number of seg¬ 
ments of type i with Kuhn length bi. The result for m = 1 
is a classic result of statistical physics [46l |47] . The result 
for m > 1 can be understood from the following simple 
argument. We expect a Gaussian distribution over A, Y 
and Z components of the end-to-end vector for a large 
number of segments (from the central limit theorem) and 
the expression for i5[A^] = E\Y'^\ = E[Z^\ = is 

trivial to derive for an ideal polymer. We also expect 
E[X] = E\Y] = E[Z] = 0, and no correlation between 
A, Y and Z [e[XY] = E[XZ] = E[YZ] = 0) from sym¬ 
metry. Only one Gaussian over A, F, Z satisfies these 
conditions, and it reduces to Eq. when expressed in 
polar coordinates and angular degrees of freedom are in¬ 
tegrated over. 


Substituting the integral of Eq. 20 into Eq. 16 


we 











obtain the following expression for AG'*°°p 


AG-ioop = ij, {p:s/pl^) 

f 3 

fo^ 4^"' ' exp 


= -i?T In 




l^2 E[r'2]ioop^ 



« -i?T In 

= -i?T In 


/ /J~‘^ Anr^dr / 3 

y ^itt^Na \2TrE[r‘^]iaop) j 

(±( _?_ x'"] 

\Na \2TTE[r2]i,,pJ i 


( 21 ) 




FIG. 6 . Bulge formation in a DNA duplex: two long duplex 
sections enclose a single-stranded loop. Bulge formation is 
a special case of loop formation within the context of DNA 
origami. 


We have assumed <C i?[r^]ioop. Thus, in the local 
model, if a transition involves the binding of the second 
domain of a staple, and is the minimal value of 

Elr"^] found for a loop containing that staple. 

Using Eq. [^we can write 

AG'°°P = -i?r 7 ln-^ (23) 

Jmin 

with 7 = 3/2 and C = 3/27r (l x M~^/Na)"^^^ = 6.7 x 
10“^® m^. We express AG*°°p in this way as it allows us 
to identify parameters that can be used to generalize our 
description. 7 = 3/2 is the well-known loop exponent of 
a freely-jointed chain [371 EH]. It gives the scaling of the 
typical volume accessible to the end of a polymer with 
the polymer’s contour length (and is therefore directly 
connected to the looping probability). It is well known 
that excluded volume interactions tend to swell a poly¬ 
mer chain jlH], with the result that an effective 7 > 3/2 is 
obtained. Theoretical estimates predict 7 ^ 1.75 [48] for 
a self-avoiding walk. The widely-used SantaLucia model, 
however, uses a value as high as 7 = 2.44 [33] in an 
equivalent calculation for the single-stranded bulge loop 
illustrated in Fig. [^ this estimate is based on a fit to 
DNA loop closure kinetics |49j . 

7 and G play very different roles in AG*°°p. Increasing 
7 at fixed G exaggerates the differences in the loop clo¬ 
sure penalty AG'°°p between longer and shorter loops, 
whilst also making all loops less stable. Increasing G at 
fixed 7 makes all loops more stable by a constant fac¬ 
tor. In this work, we explore the properties of our model 
as 7 and G are modulated. In particular, we consider 
the consequences of varying G at fixed 7 , and varying 7 
whilst adjusting G so that AG*°°p for a 18-base single- 
stranded bulge loop is fixed at the value obtained in the 
freely-jointed case. We couple changes in 7 and G in 
this fashion because changing 7 at fixed G quickly re¬ 
sults in unreasonable values of AG'°°p. The choice of a 
loop length of 18 for calibration is somewhat arbitrary. 



FIG. 7. Gomparison between the free-energy costs of form¬ 
ing a single-stranded bulge loop for different values of the 
parameter 7 at T = 60°G. A prefactor C-y is adjusted such 
that AG‘°°p for an 18-base bulge is held constant. C 1.5 = 
6.7 X 10“^® m^ C2.5 = 2.8 x 10“^® m^ G3.5 = 5.2 x 10"^® m^. 


although it is small enough to be within the range previ¬ 
ously tested [29], whilst not being so small that the un¬ 
derlying polymer physics approximations become patho¬ 
logical. The magnitude of AG*°°p is therefore likely to 
be physically reasonable at this point. As an illustra¬ 
tion, we plot AG'°°p for a purely single-stranded loop as 
a function of length for 7 = 1.5, 2.5 and 3.5 in Fig. 

The mean-squared end-to-end separation of a heteroge¬ 
neous loop, E[r^], is estimated as follows. An x-base-pair 
double-stranded domain is treated as a single segment of 
length xlds, equal to the length of the helix calculated 
using a contour length of Ids = 0.34 nm per base pair 
m- We treat an x-nucleotide single-stranded domain 
as consisting of xlss/^ss segments of length Ass, where 
Ass = 1-8 nm is the Kuhn length of single-stranded DNA, 
and we assume a contour length for the single-stranded 
section of Igs = 0.6 nm per nucleotide. These values 
are roughly consistent with experimental measurements, 
although the mechanical properties of single strands re¬ 
main somewhat controversial ISIH55]. Where two scaffold 
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domains are held together by a staple, we represent the 
link by a segment of length Xss- The smallest possible 
loop is formed by a “seam” m, when two staples con¬ 
nect two pairs of adjacent scaffold domains (for example, 
the pairs of horizontal black staples in Fig. [^. In our 
model this corresponds to E[r'^] = 2Xss- This loop cost, 
combined with duplex initiation terms [29j . is enough to 
make the binding of a second seam staple less favourable 
than a contiguous 32-base-pair domain within our model 
(at the relevant temperatures for origami assembly), even 
with coaxial stacking strength n = 2. 

We emphasize that the model for AG'^°°p used here 
is extremely basic and should not be over-interpreted. 
It captures the physics that a staple whose binding sites 
are connected by a short, flexible loop is more stable than 
a staple that closes a longer loop. The model contains 
physically meaningful parameters that can be adjusted 
to modulate this effect. It allows us to establish a gen¬ 
eral framework for modelling that could incorporate op¬ 
timized estimates of AG''°°p in the future. It is thus an 
excellent tool for our purposes in this article. 

In Ref. [3T], we used a simplified version of the local 
model, ignoring sequence-dependent hybridization ener¬ 
gies and coaxial stacking, and taking 7 = 1.5. Despite 
these simplifications, the approach taken in Ref. [31] re¬ 
tains the following key features that underlie the cen¬ 
tral results: staple insertion rates depend on previously- 
inserted staples that bring staple binding domains into 
closer proximity, so staple binding is cooperative; staples 
that span large scaffold loops are less stable than equiva¬ 
lent short-range staples; and strong cooperative interac¬ 
tions between pairs of such staples provide a compensat¬ 
ing stabilization of these long-range connections. 


D. Simulation methods 


1. Algorithm outline 

The state space S, rate matrix a{T(t)) and initial state 
s(t = 0) form an inhomogeneous continuous-time Markov 
chain (CTMC) {S,a{T{t)),s{t = 0)), where T{t) is the 
externally imposed function relating time and tempera¬ 
ture that cycles between the initial temperature Tstart, 
the final temperature Tend and then back to the initial 
temperature with a fixed rate \dT/dt\. Instead of sim¬ 
ulating the inhomogeneous CTMC directly, we approx¬ 
imate T{t) and hence the rate matrix a{T{t)) as piece- 
wise constant across 1 second intervals, which is a rea¬ 
sonable approximation for typical experimental cooling 
rates. Individual traces are then generated by applying 
the standard Gillespie simulation algorithm [SHI at each 
interval. To implement the Gillespie algorithm, it is nec¬ 
essary to calculate all transition rates ct(s, s') from the 
current state s to alternative states s'. a{s,s') will be 
non-zero in the following cases: 

1 All unbound domains in s can hybridize with com¬ 


plementary domains of staples present in solution, 
with a rate given by Eq. 

2 Domains of half-bound staples can unbind, with a 
rate given by Eq. 

3 Domains of fully-bound staples can unbind, with a 
rate given by Eq. 

4 Half-bound staples can become fully-bound if the 
opposing domain is free, with a rate given by Eq. 

m 

The term required in transition types 2 

and 3, is computed straightforwardly using the nearest- 
neighbour model of SantaLucia et al. |29| . The term 

^(T-O.stack 

is also needed in transition types 2 and 3 and is 
computed using a simple lookup on the status of neigh¬ 
bouring domains. The term /^co.^hape jjj tran¬ 

sition types 2 and 4, and this is where subtleties arise 
and the global and local models differ. In both models, 
graphs representing states s and s' are needed to calcu¬ 
late As the simulation transitions from state 

to state, the graph is updated. In the global model, a spe¬ 
cific planar embedding of the graph representing state s, 
as depicted in Fig. is used. This specific embedding is 
not required in the local model. 

The graph H(s) = (V, E(s)) itself is defined as follows: 
each junction between domains on the scaffold is a vertex 
V G y and each domain is an edge e S E(s) between the 
appropriate vertices. Fully-bound staples present in state 
s define additional edges between the two vertices that 
are linked by the staple crossovers. A labelling function 

L : E(s) —>■ {single stranded, double stranded, crossover} 

(24) 

assigns the status of each edge, which also has a fixed 
length (number of nucleotides or base pairs) if it is a 
scaffold domain rather than a crossover. Each edge e G 
E(s) is weighted as follows: 

{ {xlds)^ if L(e) = double stranded of length x, 
xlggXss if L(e) = single stranded of length x, 
Xl^ if L(e) = crossover. 

(25) 


The total weight of any loop (simple cycle) within the 
graph is then Tjr^j, the key quantity in estimating the 
loop cost (Eq. 231. 

In the global model, the graph H(s) is assumed to 
have unique planar embedding given by a set of faces, 
F(s), that are subgraphs of H(s). That is, VFj G F(s) : 
V(Fi) C V, E(Fi) C E(s). Each face represents a loop¬ 
ing constraint. The weight of each face is given as 
W(Fi) = X]eGE(Fi) ^(s)- The shape contribution to the 
free energy is thus 


^shape _ ^shape _ 


-RTj In 

F,eF(^i) 


C 

W(F,) 


(26) 
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where the set of loops in Eq. 10 is substituted with the 
faces in the embedding of H(s). As the simulation pro¬ 
gresses, the faces of the graph are merged (transition 
type 3) or split (transition type 4), and we use a cus¬ 
tom data structure to dynamically update the faces of 
the graph. During a transition, it is only necessary to re¬ 
calculate A[r^]ioop = W(Fi) and AG^°°p for the affected 
loops (faces), which are easy to identify. 

The local model does not use an embedded graph rep¬ 
resentation but does use the same weighted graph H(s). 
Estimating the change of is only necessary for 

full binding of a previously half-bound staple (transi¬ 
tion type 4): in that case we approximate AG°’®^®'P® = 
AG|((°(’, where AGI))))^ is the minimal AG^°°p of any loop 


incorporating the newly formed staple crossover in the 
new state s'. This corresponds to finding the simple cy¬ 
cle including the new crossover that minimizes E[r^]. We 
employ Dijkstra’s shortest path algorithm m to find 
the shortest path in s' between the two vertices that are 
to be connected by the staple crossover (excluding the 
crossover itself). This path is then added to the crossover 
in question to make the shortest loop. Given a state s 
and a half-bound staple p, let vi , V 2 be the vertices that 
are joined by a new edge once p becomes fully bound 
by hybridization to domain e. Let the new graph H' be 
equal to H(s) except that L(e) = double stranded, and 
let D(ui,U 2 ) be the weight of the shortest path between 
vi,V 2 in H' under W. Then 


AGt^^ = -RTj\n-^ 

Jmin 

(27) 

A[r^]min = -F D{vi,V2). 

(28) 


Example loop calculations for both the global and local 
models are given in Appendix The simulation code it¬ 
self is found at https : //github. com/f dannenberg/dna 


2. Computational Tractability 

For the origami studied in this paper, using 7 = 2.5 
and n = 2 and a temperature gradient of 1 . 0 °Cmin“^, 
160 simulated folding and melting trajectories between 
80 °C and 20 °C take 21 minutes (local model) or 30 min¬ 
utes (global model) with 10 parallel threads on worksta¬ 
tion hardware (IntelTMXeon R X5660). All other fac¬ 
tors being equal, simulating a larger structure with more 
domains is more computationally demanding, and the 
number of binding or unbinding events in a given unit 
of physical time should scale linearly with the number 
of domains. At least for the Gillespie algorithm, a lin¬ 
ear increase in the number of transitions gives a linear 
contribution to the scaling of the simulation time. 

Our software simulates our origami tile at acceptable 
speed and we have not attempted to further optimize 
our code, or establish the scaling of the cost per transi¬ 
tion with system size. The frequently-used Ml3 genome 
is approximately three times the length of our scaffold; 


such an increase in size should not render the system in¬ 
tractable. Indeed, we simulated a system of twice the size 
of that considered here in Ref. m, using a variant of the 
local model. We note, however, that other factors may 
be at least as important as system size in determining the 
computational challenge. Reaction rates are sensitive to 
staple concentration and domain hybridization free en¬ 
ergies and faster reactions lead to more transitions. In 
particular, if domains of very different stabilities are part 
of the same staple, the less stable domain would be ex¬ 
pected to bind and unbind many times before reaching a 
temperature at which it is stable, leading to a stiff sim¬ 
ulation. For a large origami with strongly heterogeneous 
domains, therefore, it may be necessary to carefully pro¬ 
file and optimize the computational protocol. Caching 
of generated transition rates, using a hash-table and an 
efficient hashing function (where states are keys and lists 
of transitions are values), was found to have a low hitting 
rate for our simulation, but may significantly benefit the 
mentioned stiff models. Upper bounds on the algorithmic 
complexity of our simulation can be found by considering 
each graph query in isolation. However, our simulation is 
dynamic, where graph structures are updated by insert¬ 
ing or removing one edge at a time, and many queries 
are performed on the same graph: the proposed upper- 
bound would not be indicative of the actual problem, and 
proper theoretical treatment lies outside the scope of this 
work. 


E. Experimental methods 

To explore whether the model gives a reasonable de¬ 
scription of origami assembly, we report experimental 
measurements of annealing and melting for the same 
scaffold and staples that we simulate with the model. 
Origami folding is followed using SYBR® Green, a dye 
whose fluorescence is strongly enhanced when bound to 
duplex DNA. We monitor fluorescence during repeated 
cooling and heating cycles of the solution containing scaf¬ 
fold, staples and SYBR® Green. A scaffold concentra¬ 
tion of 10 nM was used, with each staple present at 20 
nM. During folding, the staple concentration is therefore 
depleted by a factor of two (the model treats it as con¬ 
stant). The likely result is a slightly broader annealing 
transition during the experiment than in the limit of neg¬ 
ligible scaffold concentration. 


III. RESULTS AND DISCUSSION 
A. Initial simulations 

We first display the behaviour of the global model us¬ 
ing 7 = 1.5 and without coaxial stacking stabilization 
(n = 0). To do this, we simulate the folding of a simple 
origami as illustrated in Fig. This origami contains 
166 domains, each of 15 or 16 base pairs (apart from two 
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FIG. 8 . Staple binding as a function of temperature during cooling and heating of a simple origami in the global model. In 
column i we show the average number of scaffold domains that are occupied as a function of temperature. Curves are averages 
over 160 cycles with a temperature ramp of 1 °C per min. Also shown is the probability of two-domain binding for a specific 
staple X which is highlighted in the schematic shown in column ii. In column ii we indicate the incorporation temperature of 
each staple (the highest temperature for which the probability of full binding is > 50% during annealing). Staples with a low 
incorporation temperature < 56°C are coloured dark blue, (a) Simulation using different (sequence-specific) duplex binding 
energies for each domain, 7 = 1.5 and coaxial stacking strength n = 0. (b) As (a), except that all 15/16 base-pair domains are 
taken to have duplex equal to the average for a 16 base-pair domain, (c) As (a) but using 7 = 2.5 and coaxial stacking 
strength n = 2. 


long domains of 32 base pairs); the sequences of scaffold 
and staples are equal to that of the ‘monomer tile’ in Ref. 
m- All staples are at a concentration of 20 nM, and the 
scaffold strands are assumed to be dilute enough that 
reduction of staple concentration during folding can be 


neglected. We simulate 160 independent folding trajecto¬ 
ries using the Monte Carlo algorithm outlined in Section 
|TTdT] In each case the system temperature is reduced 
by 1°C min“^ from 85°C to 25°C, at which point the 
temperature is cycled back to 85 °C at the same rate. 
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FIG. 9. Comparison of an experimental measurement of 
origami folding, as reported by SYBR® Green fluorescence I, 
and model behaviour, (a) —dl/dT is reported against T dur¬ 
ing cooling (red, solid) and heating (blue, dashed) between 
80 °C and 40 °C at a rate of 1°C min“^, for a single experi¬ 
mental system of 10 nM scaffold, 20 nM of each staple and 1 x 
SYBR® Green. I is a proxy for the degree of hybridisation. In 
(b) and (c) we plot —dP/dT, where P is the average number 
of scaffold domains occupied, obtained with the global model 
under the same conditions, averaged over 160 realizations, (b) 
n = 0, 7 = 1.5. (c) n = 2, 7 = 2.5. Plotting —dl/dT rather 
than I makes comparison easier without adjusting for baseline 
effects. All curves are smoothed to improve visibility: experi¬ 
mental data is averaged over a 72 s window, corresponding to 
a temperature change of 1.2 °C, and simulation data over a 
window of 0.3 °C. 


Fig.|( a) shows the average fraction of domains that 
are bound as a function of temperature during both an¬ 
nealing and melting, together with the degree of incor¬ 
poration of a single typical staple (staple X). Origami 
assembly occurs at Ta « 65 °C. This midpoint of the an¬ 
nealing transition is approximately consistent with that 


observed in the equivalent experimental system (Fig. |^. 
The model shows very little hysteresis: annealing and 
melting curves nearly overlap, despite the rapid rate of 
cooling. Hysteresis is a generic feature of origami sys¬ 
tems naniiiT], and significant hysteresis is also seen 
in our experimental measurements (Fig. |^. It is clear 
that the basic model with 7 = 1.5 and without coaxial 
stacking fails to capture this effect. 

The simulated melting and annealing transitions are 
fairly broad: the transition from 20 % of domains bound 
to 80% of domains bound during annealing occurs over a 
temperature range of ATa « 9°C. Transition widths in¬ 
ferred from experimental data (particularly in the case of 
melting) are clearly sharper than predicted by the model, 
as shown in Fig. 

Interestingly, the widths of the simulated folding and 
unfolding transitions are not reflected in the correspond¬ 
ing curves for individual staples. For example, staple 
X goes from 20% to 80% bound over a temperature 
range of AT^ « 2.5°C. Rather, this width represents 
the range of incorporation temperatures for individual 
staples; some staples are more stable than others due 
to differences in sequence and between the loops that 
they must enclose. These differences are evident in the 
heatmap of incorporation temperatures in Fig. §(a.ii)). 
In Fig. [^(b), we consider a system in which all do¬ 
mains (except the two longer domains of 32 base pairs) 
are assigned the same AG'°'^“p^®’'(T). ATa is drastically 
reduced, and is not much larger than AT^ if outlying 
single-domain staples are excluded. The narrow range of 
incorporation temperatures for all two-domain staples in 
this case is very clear from Fig.|^(b.ii). 

B. Exploring parameter space 

We now explore the effects of varying model parame¬ 
ters. When specifying the model in Section [llj the key 
quantities that were left as explicitly variable were loop 
parameters 7 and C (see Eq.[^, and the strength of 
coaxial stacking AG^^’^^{T) = n{AG'^P{T)). Table | 
shows the variation in the annealing temperature, hys¬ 
teresis, and annealing transition width with 7 and n. 
Note that C is varied with 7 to ensure a constant cost 
for a 18-base bulge as discussed in Section E 

The following trends are clear. 

1 . Increased coaxial stacking leads to higher melting 
and annealing temperatures, sharper transitions 
and increased hysteresis. 

2. Increased 7 leads to lower annealing temperatures 
and increased hysteresis, but has only a weak effect 
on transition widths. 

Fig. §( c) shows a combination of these effects for 7 = 2.5 
and n = 2. 

The consequences of coaxial stacking are easiest to un¬ 
derstand. Coaxial stacking stabilizes the origami struc¬ 
ture, and it is therefore unsurprising that it increases T^. 
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Tare 


Coaxial stacking 

n 

Global model - 7 
1.5 2.5 3.5 

Local model - 7 
1.5 2.5 3.5 

0 

64.6 

63.4 

61.9 

64.6 

63.2 

61.7 

1.0 

66.6 

65.2 

63.8 

66.4 

65.1 

63.8 

2.0 

68.0 

66.7 

65.5 

67.9 

66.6 

65.5 

3.0 

69.1 

67.9 

66.7 

69.0 

67.8 

66.7 


a/2 

Ta/°C 

Hysteresis/ °C 

ATa/°C 

0.5 X G 5/2 

65.3 

1.9 

5.0 

1.0 X G 5/2 

66.7 

1.8 

5.1 

2.0 X G 5/2 

68.2 

1.6 

5.2 


TABLE II. Effect of varying on the hysteresis, annealing 
temperatnre and width of the transition curve, using 7 = 5/2 
and coaxial stacking equivalent to 2 bp. 


Hysteresis/ °C 


0 

0.1 

0.4 

0.8 

0.2 

0.4 

0.6 

1.0 

0.6 

1.0 

1.3 

0.7 

0.8 

1.0 

2.0 

1.5 

1.8 

1.9 

1.4 

1.7 

1.7 

3.0 

2.2 

2.5 

2.7 

2.3 

2.5 

2.4 


ATa/°C 


0 

8.7 

8.6 

8.6 

00 

bo 

00 

06 

9.1 

1.0 

6.6 

6.5 

6.5 

6.7 

6.7 

6.8 

2.0 

5.2 

5.1 

5.0 

5.4 

5.3 

5.2 

3.0 

4.4 

4.2 

4.3 

4.4 

4.3 

4.3 


TABLE 1. Model properties as functions of exponent 7 and 
coaxial stacking strength n. Data reported, averaged for all 
simulation runs, are annealing temperature Ta (the first point 
at which a moving average of at least 50% of domains are hy¬ 
bridized during annealing), hysteresis {Tm — Ta, where Tm 
is an equivalent quantity to Ta defined during melting), and 
transition width ATa (the range of temperatures over which 
domains go from 20 % bound to 80% bound during annealing). 
Data is presented for heating/cooling rates of 1 °C min“^ for 
the origami shown in Fig.[^ Standard errors on the mean, es¬ 
timated from 160 independent cycles, are smaller than 0.1 °C 
for all data reported. 


It is a cooperative interaction, meaning that the binding 
of one staple favours the subsequent binding of another. 
Cooperativity tends to result in narrower transitions, as 
the binding of isolated staples is suppressed relative to 
the formation of well-folded regions. As partially-formed 
regions are relatively disfavoured, cooperativity tends 
to exaggerate hysteresis (the system must pass through 
more substantial free-energy barriers to assemble or melt, 
slowing down kinetics). 

The influence of 7 is more subtle. The net effect of 
increasing 7 is to increase the penalty for loop closure, 
AG''°°P, for longer loops relative to that for shorter loops. 
We adjust C to maintain the penalty for a 18-base ss- 
DNA loop: for the origami folding process studied here, 
the majority of loops are longer than this so the average 
free energy of loops is increased, explaining the drop in 
Ta with increasing 7 . The presence of previously-bound 
staples can substantially reduce the lengths of the loops 
closed by incoming staples, and thus the entropic cost 
of loop closure, and might thus be expected to lead to 
stronger cooperative effects. Consistent with this hy¬ 


pothesis, we see that increased 7 leads to larger hys¬ 
teresis. Transition widths, however, show only a weak 
dependence on 7 . 

Two effects reduce the dependence of transition widths 
on 7 by counteracting the expected narrowing as a con¬ 
sequence of enhanced cooperativity. Firstly, increasing 
7 has the effect of increasing the intrinsic differences in 
stability between staples by increasing the sensitivity of 
the loop-closure penalty to loop length. This tends to 
make the overall annealing transition broader if no other 
effects come into play. Consistent with this, if we assign 
a state-independent AG'*°°p to each two-domain staple 
(by calculating the loop cost for a staple binding to an 
otherwise empty origami, and not updating this value as 
other staples bind), we observe that ATa systematically 
increases with 7 (see Appendix [E| . 

Secondly, there is a competing anti-cooperative effect, 
mediated by loops, that grows with 7 . AG^°°p grows 
with the expected end-to-end separation ^/E[r'^] of the 
DNA that forms the loop. One way to reduce this cost 
is to create a shortcut through binding a two-domain 
staple across the loop (the cooperative effect discussed 
above). Alternative transitions, however, transform a 
scaffold domain within the loop from single-stranded to 
double-stranded DNA without creating a shortcut. Due 
to the increased stiffness of double-stranded DNA, con¬ 
verting a section of ssDNA to dsDNA always increases 
and hence AG^°°p. Increasing 7 can thus make 
some staples less cooperative (or more anti-cooperative), 
reducing the impact of 7 on AT^. We note that the model 
of Arbona et al. reduces the cost of loop formation by 
an amount proportional to the number of duplex base 
pairs within the loop [55] , although the reason for this is 
unclear. 


In Table |TT|we show the results of varying the param¬ 
eter G (Eq. 23) at fixed 7 and coaxial stacking strength 
n. The dominant effect is that annealing temperature 
rises with G; comparatively small changes are seen in 
the degree of hysteresis and the width of the annealing 
transition. This behaviour is reasonable: from Eq.[23| it 
is clear that G gives rise to a loop-independent contribu¬ 
tion to AG^°°P, and hence its dominant effect is to modify 
the stabilities of all two-domain staples in a systematic 
fashion. 


For the rest of this work, we take 7 = 2.5 and n = 2 
as parameter values which match the scale of hysteresis 
and cooperative effects seen in the experimental data of 
Fig .0 Our value for exponent 7 is close to that used by 
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SantaLucia |23]. There is limited data on the stabilizing 
effects of coaxial stacking, but reported values of 
at T = 37 °C and 1 M Na'*’ are about 15% larger in mag¬ 
nitude than the contribution of one base pair [SHI HH] ■ It 
is not clear how this parameter changes as the temper¬ 
ature increases towards 65 °C, the temperature around 
which annealing/melting occurs. Our parameters are not 
fitted or optimized, but rather a reasonable choice that 
allows us to study generic phenomena in origami folding. 
More experimental data would allow refinement of our 
basic description of the contributions to the free energy 
of a part-folded origami, both in terms of the parameters 
7 and n and more generally with regard to the functional 
form of our free energy. 


C. Global and local models 


In Section |II C 3| we introduced two alternative ap¬ 
proaches: the global and local models. Data presented 
hitherto has been for the global model. The local model 
is a simpler but less rigorous alternative, which may prove 
useful in modelling more complex origami structures. To 
use the local model with confidence, however, it is im¬ 
portant to establish how closely it matches the global 
approach. 

Results from both models are presented in Table |T] 
Their predictions are quantitatively similar, and most 
physically relevant trends are reproduced. There are sys¬ 
tematic differences, however. Most noticeably, the local 
model predicts a weaker dependence of hysteresis on 7 
(Appendix [^. As discussed in Section IIC 4 the local 
model cannot capture all staple interactions mediated 
via loops, and so it is perhaps unsurprising that the most 
noticeable differences between the two approaches should 
be manifested in the response to 7 . 


D. Perturbed systems 

We now explore whether the proposed model can cap¬ 
ture the consequences of various perturbations to the sys¬ 
tem. One of the simplest possible perturbations is to 
reduce the concentration of staple strands [27]. In Fig. 


of staples to 2nM, retaining the assumption that these 
staples are in substantial excess over scaffolds (see Fig. 
[^(c) for comparable data for a staple concentration of 
20 nM). The clearest effect is the substantial drop in T^, 
both for the origami a whole and staple X individually; 
by contrast, the melting transition is largely unchanged 
from the higher-concentration case. As a result, hystere¬ 
sis is enhanced: reduced concentrations slow the rate of 
hybridization but not melting. Similar results were re¬ 
ported in an experimental study |27j . 

There is, however, a more subtle effect at play, which 
can be most clearly seen if we revert to 20 nM staple con¬ 
centrations and compare temperature ramps of 10°C per 


10 (a), we show the effects of reducing the concentration 


minute and 0.1 °C per minute as shown in Fig. 10 (b) and 
(c). Comparing the two protocols, we see that, on rapid 
heating, the melting temperature increases by around 
1.3°C, whereas the annealing temperature Ta decreases 
by 5.3 °C. Similar behaviour was observed experimen¬ 
tally by Sobczak et al. [23], leading those authors to 
conclude that “folding rather than unfolding was not in 
equilibrium”. It is difficult to interpret this conclusion, 
however, as the question of whether the system is in equi¬ 
librium involves the balance or otherwise of folding and 
unfolding transitions with each other. 

When a system is driven out of equilibrium by fast tem¬ 
perature ramps, one would generally expect the melting 
temperature to rise and the annealing temperature to fall 
relative to the equilibrium case, as the system responds 
to the external driving with some time delay. Within 
our model we do observe this behaviour, but the shift in 
the annealing transition is far larger. To understand this 
asymmetry, note that a major feature of the model is that 
the rate at which a staple fully binds to the origami is 
only weakly temperature-dependent (and binding rates 
of the first domain are temperature-independent) [34j . 
By contrast, the rate at which a two-domain staple un¬ 
binds is extremely temperature-dependent, increasing by 
approximately an order of magnitude following a 2°C 
increase in temperature (Section [IIB ). This increase in 
melting rate with temperature allows the system to re¬ 
spond more quickly to being raised rapidly above its equi¬ 
librium melting point; by contrast, annealing rates do 
not notably increase with decreasing temperature, and 
so the system responds relatively slowly to being lowered 
rapidly below its equilibrium melting temperature. 

An alternative perturbation is to remove certain sta¬ 
ples from solution; such an experiment was performed by 
Wei et al. m- We explore the effect on the incorpora¬ 
tion of staple X of removing from solution either: (i) all 
neighbouring staples or (ii) equivalent staples on the op¬ 
posite side of the origami (Fig. 111. The effects are clear 
- the absence of staples from the local neighbourhood has 
a substantial influence on the incorporation of staple X, 
whereas more distant staples have very little effect. Local 
cooperativity, which was also observed by Wei et al. 
arises naturally in a model in which interactions between 
staples are mediated through loops and coaxial stacking. 
Our model predicts that hysteresis exhibited by staple 
X is strongly suppressed if its neighbouring staples are 
removed, reducing the strength of cooperative effects. 


IV. SUMMARY AND CONCLUSIONS 

We have presented a domain-level model of DNA 
origami folding and explored its properties. Our mod¬ 
elling framework is explicitly kinetic and naturally allows 
for hysteresis, local cooperativity and the natural vari¬ 
ability of assembly pathways during origami folding. We 
define a thermodynamically self-consistent model (our 
‘global’ model) that is restricted to planar origami de- 
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FIG. 10. Simulations of the global model for different staple concentrations and heating/cooling rates. In all cases, we use 
7 = 2.5 and coaxial stacking strength n = 2. In column i we show the average number of scaffold domains that are occupied 
as a function of temperature. Curves are averages over 160 cycles with a temperature ramp of 1°C per min. Also shown is 
the probability of two-domain binding for a specific staple X highlighted in the schematic shown in column ii. In column ii we 
indicate the incorporation temperature of each staple (the highest temperature for which the probability of full binding is > 50% 
during annealing). Staples with a low incorporation temperature < 56°C are coloured dark blue, (a) Staple concentration of 
2 nM, cooling rate of 1 °C min“'^. (b) Staple concentration of 20 nM, cooling rate of 10 °C min“^. (c) Staple concentration of 
20 nM, cooling rate of 0.1 °C min“^. 


signs. We also define a ‘local’ model of origami fold¬ 
ing which is based on the same physical principles but 
less thermodynamically rigorous. The local model is eas¬ 
ier to simulate and can be used for non-planar designs. 
We demonstrate that the two approaches give similar re¬ 
sults for a simple origami, with small and understandable 


quantitative differences. 

Within a general framework for origami modelling, we 
consider a specific form for the change in free energy as¬ 
sociated with staple binding. Duplex binding free en¬ 
ergies are taken from the SantaLucia parameterization 
of the nearest-neighbour model of DNA thermodynamics 
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FIG. 11. Staple insertion in the presence of defects, (a) The probability that staple X, highlighted, is fully bound to the 
scaffold is plotted as a function of temperature for simulations in which certain staples are removed from the system (global 
model ,7 = 2.5, n = 2). Curves are averages over 160 cycles with a temperature ramp of 1 °C min“^. Removing distant staples, 
coloured blue in (b), does not significantly change the melting/annealing curves, but removing staples around the marked staple 
(coloured red in (b)) does reduce the incorporation temperature significantly. 


[29| . with an additional contribution from coaxial stack¬ 
ing of adjacent domains. The free-energy cost of con¬ 
straining the origami scaffold to form a loop is calcu¬ 
lated as AG'^°°P = —i?T 7 ln ; where Ni 

is the number of Kuhn lengths of domain type i (ssDNA 
or dsDNA) present in the loop, bi is the Kuhn length 
of domain type i and 7 and C are constants. This de¬ 
scription is very simple, and is based on approximations 
that will break down under certain circumstances (for 
example, the original freely-jointed chain derivation that 
gives 7 = 3/2 is based on the assumption that there are 
many Kuhn lengths in the loop). Nonetheless, AG^°°p 
captures an important contribution to the cooperativity 
of origami folding. We show that larger values of the 
loop exponent 7 give stronger interactions between sta¬ 
ples. Interestingly, we find that the formation of a duplex 
domain within a loop by hybridization of a single staple 
domain can increase the free-energy penalty for closing 
the loop. 

We have demonstrated that parameters of the model 
can be chosen to give results consistent with experimen¬ 
tal data (7 = 2.5 and n = 2). Hysteresis in the model is 
asymmetric (annealing curves are shifted further from the 
equilibrium transition temperature than melting curves) 
and a reduction of the concentration of staples primarily 
influences annealing rates: both observations are consis¬ 
tent with experimental evidence [231 Ei]. Origami fold¬ 
ing within the model is naturally locally cooperative: the 
omission of selected staples has almost no effect on the 
insertion of distant staples, as observed experimentally 
[27] . This is consistent with the observation that distinct 
sections of origami can fold independently at different 
temperatures [53]. These physically reasonable findings 
serve to validate the general framework of the model. 

This domain-level model is extremely simple and has 
considerable potential for improvement. Major simpli¬ 
fications include the lack of an explicit representation 
of the geometry of the origami. This could have sev¬ 
eral effects. For example, when an arm of a staple de¬ 


taches from an otherwise intact origami, the resultant 
single-stranded scaffold domain will be held in place by 
the remainder of the origami, reducing its entropy and 
favouring reformation of the domain in a manner that is 
only partly captured by our treatment of scaffold loops. 
More generally, the greater geometrical order imposed 
by staple binding may contribute to cooperative effects 
that are not well described by the model. The calcula¬ 
tion of AG'°°P is very approximate, and fitting to more 
detailed data (as was attempted by Arbona et al. [26] 1 
for specific loops may give better quantitative modelling 
- although the functional form as it stands is instructive 
in elucidating the physical effects which are important 
in staple-staple interactions. We note that the use of 
7 = 2.5, coaxial stacking strength n = 2 may be com¬ 
pensating for physical effects that are neglected in the 
model, such as cooperativity mediated by overall scaffold 
geometry (Arbona et al. propose a non-specific attrac¬ 
tive interaction between helices [53], which could also be 
incorporated). 

Extending the model to allow interactions other than 
hybridization between fully complementary staple and 
scaffold domains would be desirable. Such interactions 
include partial hybridization of staples to off-target scaf¬ 
fold domains and scaffold secondary structure. This may 
prove challenging as such misbonding interactions do not 
naturally respect the abstraction of DNA hybridization 
at the level of domains. Off-target staple-scaffold inter¬ 
actions would have to be remarkably strong to cause sig¬ 
nificant effects as in the critical temperature range over 
which folding occurs even the binding of a fully com¬ 
plementary staple domain is transient (an origami with 
a wider range of scaffold-staple binding strengths may 
be more prone to staple misbonding). The high tem¬ 
peratures of origami assembly should also limit the con¬ 
sequences of scaffold secondary structure. Indeed, the 
MFOLD software [44] predicts that at 65 °C (a typical 
origami assembly temperature) there is only one signif¬ 
icant secondary structure motif in the scaffold that we 
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use, a 15 bp hairpin containing a single mismatched base 
pair. 

Future work will also focus on improving the param¬ 
eterization of free energies, and exploring the physical 
consequences of staples with more than two domains or 
with domains of significantly different lengths. Impor¬ 
tant open questions include how and why 3D origami 
folding differs from 2D origami. It remains to be seen 
whether observed differences, e.g. stronger hysteresis for 
3D structures, can be explained with models of the kind 
proposed here. 
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Appendix A: Simulating staples with more than two 
domains 


1. State space 

It is trivial to extend the state space of the model to 
consider staples with more than two domains. For ex¬ 
ample, for sections of a scaffold that can bind to three 
domains, we have the following states p (and associated 
permutations): 

• 000: no staple bound to any scaffold domain. 

• 100: a single staple is bound to the first domain, 
the second and third domains are empty. 

• 110: a single staple is bound to the first two do¬ 
mains, the third is empty. 

• 101: a single staple is bound to the first and third 
domains, the second is empty. 

• 120: distinct staples are bound to the first and sec¬ 
ond domains, the third is empty. 

• 111: a single staple is bound to all three domains. 

• 112: a single staple is bound to the first two do¬ 
mains, a second, distinct staple is bound to the 
third domain. 


• 123: distinct staples are bound to all three do¬ 
mains. 


2. Kinetics of multi-domain staples 

In principle, kinetic models can be constructed in a 


(Eqs. and [^. Binding of secondary or ter¬ 

tiary domains of a staple introduces looping constraints, 
as before, and the rates can be defined to take these 
constraints into account. Duplex and coaxial stacking 
free energies can be calculated as before - we discuss 
additional subtleties associated with the calculation of 
AG^^ape^ and important topological issues, below. 


very similar manner to that described in Section IIB 


3. ^(^ahape global modcl 


A two-dimensional origami with multi-domain staples 
can generally be represented as a planar graph (unless 
parallel crossovers are used). Each staple crossover can 
be labelled as internal or external based on the intended 
design. However, intermediate states in which a staple 
binds by non-adjacent domains (such as p = 101) are, 
in general, not planar due to the edge that contains the 
unbound central staple domain (Fig. 12). One solution to 
this problem would be to forbid states in which a staple 
binds to non-consecutive domains, such as p = 101. Such 
an approach is taken by Arbona et al. to simplify their 
model [26]. An alternative, less drastic approach might 
be the following. 

• First, consider only crossovers that link consecu¬ 
tive bound domains of staples (green staples in 
Fig. 12 3), and identify the faces and loops based 


on the resultant graph H. 

• Then consider in turn each staple that is bound by 
non-consecutive domains (red staple in Fig.[T^), 
assi gning lo ops based on the smallest cycle (see Sec¬ 
tion IID 1) involving a graph constructed using H 


and the staple-link in question. 

The result is a thermodynamically well-defined model, 
in that each bonding configuration can be assigned an 
unambiguous free energy. Note that links formed by 



FIG. 12. Three-domain staple in fully bound (A) and semi¬ 
bound (B) states. In the case of state B, a planar embedding 
is not possible (the arcs of the graph cannot be drawn in 2D 
without them crossing); the global model has to be adapted 
to deal with this situation. 
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the binding of non-consecutive domains of a staple (cf. 
Fig. 12 3) are not considered when computing the looping 
constraints in H. 


4. ^(^shape local model 

As the local model does not require a planar graph, 
estimates of AG'°°p can be made directly from a full 
non-planar graph involving all staples. 


due to the external face between any two intermedi¬ 
ate states s and s' that can interconvert is therefore 
always very small. Given the approximate nature 
of the model, these small differences are not signif¬ 
icant. 

A possible alternative approach would be to ignore the 
large external loop in calculating - complicating 

the model in this way seems unnecessary at this stage, 
however. 


5. Topological considerations 


Appendix C: Examples of loop and rate calculations 


Multi-domain staples introduce topological assembly 
problems. For example, the direct transition p = 101 —>■ 
p = 111 (Fig. 12 3 to Fig.[l2]A.) is impossible, as the cen¬ 
tral staple domain and the scaffold cannot twist round 
each other to form a double helix whilst maintaining 
contact through the exterior domains. The model would 
have to be modified to take account of such effects. Topo¬ 
logical effects should, however, be describable through 
effective modulations of a domain-level model - for ex¬ 
ample, to forbid topologically impossible transitions. 



Appendix B: The number of faces (loops) in the 
global model 

As noted in Section [Tl C 3[ the graph corresponding to 
an origami without staples has two faces (one internal, 
one external). For every two-domain staple that is sub¬ 
sequently added, one more face is formed. Thus, for an 
origami with m fully-bound two-domain staples, we have 
m -I- 2 faces. 

In our global model, faces are interpreted as looping 
constraints with an associated AG^°°p. Is it problematic 
that our formalism introduces m + 2 constraints? A cir¬ 
cular origami has, in reality, m -I- 1 looping constraints 
(i.e., an emptly, circular scaffold has a single constraint). 
So the question is whether the single extra looping con¬ 
straint in the global model is a problem. To see that it 
is not, consider the following: 

• whether the total number of loop constraints is Tn+ 
2 or m -I- 1 , the effect of adding an extra staple is 
to create an additional constraint; 


FIG. 13. Graph representation with labelled edges of an 
origami in four partly-folded states (a — d). The faces are 
enumerated in Table mil 

Fig. shows a partly-folded origami in a variety of 
states: we discuss the calculation of the term AG^^'^p® 
for transitions between these states. 

Firstly, in both the global and local models, initial 
binding from solution and unbinding of a second staple 
domain do not require estimation of AG®’'®p®. Transi¬ 
tions cr(b, a) and cr(c, a) fall into these categories. Fol¬ 
lowing Eq.j^and Eq.[^ we find 

cr(c, a) = fc+[staple] (Cl) 



the effect of the large external face on AG® is al 
ways small. In the initial state, the external face of 


and 


staples are added, the loop remains large, although 
in the final state it is somewhat smaller, consist¬ 
ing of 66 duplex domains and 10 staple crossovers 
{E[r^\ « 1986 nm^). Even for 7 = 2.5, AGg^fT^ 
of the large external loop only changes by 0.56 
kcal/mol between the limiting cases of the free scaf¬ 
fold and fully-folded state. The change in AGg^^*)^® 


[Jis associated with 


G] = 2858 nm^). As 

(T(b, a) = exp ^AG° 


(C2) 


in which AG° is the estimated standard free-energy 
change of formation of the duplex in question and [staple] 
is the staple concentration (there are no coaxial stacking 
changes to consider in this case). 

In the global model, all o ther transitions require esti¬ 
mates of AG®*'®'P®. In Table III we identify the relevant 







20 


State Face 

Edges 

Weight 

E[r'^] /nm^ 

a Fi 

ei, 62 , Cl, Cg, eio 

2 X (xlds)‘^ H" (2x/ss/Ass + 1) X Ass 

97.0 

Fa 

^3 5 ^4 5 ^5 7 ^ 6 ; 67 ; Cs) Cl 

1 X (xlds)^ H" (Sx/ss/Ass + 1) X Ass 

119.2 

Fs 

Cl ) ^2 ; C 3 7 7 C 5 7 C 6 7 C 7 7 Cs 7 Cg 7 GlO 

3 X (xlds)‘^ H" (7x/ss/Ass + 0) X Ass 

209.7 

b Fi 

61,62, Cl, eg, 610 

2 X (xlds)^ H" (2xlss/-^ss + 1) X Ass 

97.0 

Fa 

cs, 64 , ca, 67 , eg. Cl 

2 X (xldsf + (2xlas/-\ss + 2) X AL 

100.2 

Fs 

65 , 66, Ca 

0 X (xlds)'^ H" (2x/ss/Ass + 1) X Ass 

37.8 

F 4 

Cl 7 C 2 7 C 3 7 7 C 5 7 C 6 7 Cy 7 Cs 7 Cg 7 Cio 

4 X (xlds)^ H" (Gx/ss/Ass + 0) X Ass 

222.0 

c Fi 

61, 63, Cl, Cg, 610 

2 X (xlds)‘^ H" (2xlss/Ass + 1) X Ass 

97.0 

Fa 

Cs, 64 , 65 , 66 , 67 , eg. Cl 

0 X {xldsY + {&xlss/Xss -1-1) X AL 

106.9 

Fs 

Cl 7 C2 7 C 3 7 C4 7 C 5 7 CQ 7 C 7 7 Cs 7 Cg 7 ClO 

2 X {xlds^ ~\~ (8x/ss/Ass + 0) X Ass 

197.3 

d Fi 

61, 63 , Cl, Cg, 610 

2 X ixlds)'^ ~\~ (2xlss/Ass + 1) X Ass 

97.0 

Fa 

C 3 7 C^ 7 C 5 7 Cg 7 Cy 7 Cs 7 Cl 

2 X {xlds)"^ ~\~ {/^Xlss / Ass + 1) X Ass 

131.5 

Fs 

Cl 7 C 2 7 C 3 7 C/^ 7 C 5 7 Cg 5 Cy 7 Cs 7 Cg 7 ClQ 

4 X {xlds')^ (6x/ss/Ass + 0) X Ass 

222.0 


TABLE III. Weights as given by the global model for the faces of the system depicted in Fig. |13| For simplicity, we take 
all scaffold domains to have the same length of 16 nt, with contour length L = 16 x 0.34 nm in the single-stranded state. 
Double-stranded edges are underlined. 


faces and associated loop costs (a subset of loop costs 
are also relevant to the local model). We recall that loop 
costs are determined by which is in turn given by 

summing over the weights of individual edges in a loop. 
These edges contribute (Eq.[^: 

W(ci) = = (1.8 nm)^ for a crossover; (C3) 

{xldsf = (0.34 nm x xY for x-bp dsDNA 
xlss^ss = 1.08 nm^ x x for a;-nt ssDNA 

(C4) 

These expressions are used in Table |III| to calculate 
for the various faces in the global model. 


using Combining these specihc values with Eq.[^ and 
Eq.[^ we find 

cr(a, c) = 0.6546 • k+ exp (AG°^'^P‘®’^/i?T) x M, (C9) 

(T(d, a) = 0.6782 • k+ exp x M, (CIO) 

cr(a,b) = 1.998-10"^ XX M. (Cll) 

It is clear that the unbinding rate for a half-bound staple 
(transition a —)■ c) depends on the state of the origami, 
while the unbinding rate for a domain of a fully-bound 
staple (c —a) does not. This is a direct result of the 
choice of rate constants recorded in Equations and 
Different approaches are possible without violating ther¬ 
modynamic consistency of the model. 


2. Local model 


1. Global model 

In the global model the shape term is computed as (Eq. 
23) 




(C5) 


For transitions (T(a, c) and (T(d, a), the local model uses 
AG^h^-pe = 0 as no loops are formed during these transi¬ 
tions. Thus 


cr(a,c) = fc+exp (AG°j“P'®=^/i?r) x M, 
cr(d, a) = k+ exp (AG°^P^®’'/i?T) x M. 


(CI2) 


where F(s) are the faces in the embedding of H(s) and 
W(Fi) is the weight of a face. Taking 7 = 2.5 (and G2.5 = 
2.8 X 10“^®m^), we find at T = 37.0 °C 


^gshape ^ 3 g 2 g kcal/mol, (C 6 ) 

AG^hape ^ _o.261 kcal/mol, and (C7) 
^gshape ^ _o _239 kcal/mol. (C 8 ) 


In case of the transition < 7 ( 3 , b), a new loop is formed. 
Thus we must find the cycle in b containing C 2 that min¬ 
imizes E[r^] - this is the loop consisting of edges 

65)66,02. (CI3) 


Using the value for Mr^] for this loop tabulated in (Table 
III), with Eq.[^ Eq.j^and Eq. 23 we find 


cr(a, b) = 1.494 • 10"^ x fc+ x M 


(CM) 
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Appendix D: Justification of the form of 

To justify our treatment of the thermodynamic cost 
of forming a single loop in our model, we consider the 
thermodynamic cycle shown in Fig. We consider the 
binding of two domains to a longer strand, both when 
they are connected to form a single staple and when they 
are separate strands. 






FIG. 14. Thermodynamic cycle. We consider two domains, a 
(red) and b (blue), that can bind to a scaffold. We consider 
the thermodynamics of binding when the two are connected 
and when they are separate. Estimates of the free energies 
of transitions (i), (ii) and (iv) are used to estimate that of 
transition (iii). 

We take the entire system to be contained within an 
arbitrary volume v. The free energy change associated 
with transition (i) is simply the sum of those for the 
formation of two independent duplexes: 

AG(,) = AG° + AGg + 2i?Tln(u/u°), (Dl) 

where the first two terms are standard free energies of 
formation, and the third corrects for the fact that stan¬ 
dard free energies are dehned at 1 M concentration of 
reactants [45] . 

The free energy change associated with transition (ii) 
includes AGbreak, the cost of breaking the DNA back¬ 
bone (independent of v). It also includes the entropy 
gain associated with allowing the two separate halves to 
explore volume v independently which depends on the 
probability, that two unconnected halves sponta¬ 
neously come within an interaction distance, Tc, that is 
comparable to the length of a backbone link: 


r,/°c 


Coaxial stacking 

n 

Local model 

1.5 2.5 

- 7 

3.5 

0 

64.2 

62.7 

61.4 

1.0 

66.0 

64.7 

63.6 

2.0 

67.5 

66.3 

65.2 

3.0 

68.7 

67.5 

66.6 


Hysteresis/ °C 


0 

0.2 

0.2 

0.2 

1.0 

0.5 

0.5 

0.4 

2.0 

0.9 

0.8 

1.0 

3.0 

1.5 

1.3 

1.5 


ATa/°C 


0 

9.5 

9.8 

10.5 

1.0 

7.1 

7.5 

8.0 

2.0 

5.5 

5.8 

6.4 

3.0 1 

4.7 

4.8 

5.5 


TABLE IV. Predictions of the local model when modified 
to remove effects of loop-mediated cooperativity (see text). 
Data is comparable to that presented in Table jlj 

Transition (iv) also involves the breaking of the DNA 
backbone, but this time the freedom gained is not the 
ability to move around the whole of u, but rather the 
freedom for the scaffold to explore conformations that 
were inaccessible when it was constrained to form a loop. 
Thus, instead of we obtain the probability 

that the ends of the two halves connected by the loop 
spontaneously come within when they are not directly 
connected: 

AG(„) = AGbreak + RT ■ (D3) 

By inspection of the thermodynamic cycle: 

= AG(ij) -I- AG(^i) — /S.G(iy)^ (D 4 ) 

and thus 

AG(rrr) = AG0 + AG0 + Prin((A!^y (D5) 

P/''= scales as 1/u, and so we can replace P^‘= with ^P^S 
to cancel one of the factors of v/v^ inside the logarithm. 
Overall, reaction (iii) is a bimolecular association, and 
therefore the free-energy change in a volume v can be 
converted into a standard free energy by adding a term 
RTln(v^/v), cancelling the second factor of u/u° and giv¬ 
ing 


AG(,,) = AGbreak + PTln(P;=), 


(D2) 


AG0,,)=AG° + AG0 + Prin(^). (D6) 
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FIG. 15. Staple binding as a function of temperature during cooling and heating of the simple origami for coaxial stacking 
constant n = 2 and loop exponent 7 = 2.5, staple concentration 20 nM and temperature gradient \dT/dt\ = 1.0 °C min“'^. Full 
lines: global model; dashed lines: local model. Red/blue lines indicates behaviour averaged over all staples; black/grey are 
predictions for a specific staple X (cf. Fig. |10[) - these data for X were smoothed using a cubic spline. 


We can compare with Eq.|^which also describes a change 
of state in which a staple fully binds to the origami. 
AG° + AG° corresponds to the contribution, 

and RT [Plo /to as required. Any 

changes in AG®*®®^ due to the presence of adjacent du¬ 
plex domains could be incorporated into this derivation 
without influencing its outcome. 


Appendix E: Additional data 


1. Effects of changing 7 that are not dependent on 
cooperativity between loops 


In Section III we argue that the increasing 7 would 
tend to increase transition widths in the absence of co¬ 
operative interactions between staple loops. To quantify 
this effect, we simulate a system from which the effects 
of loop-mediated cooperativity have been artificially re¬ 


moved. This change is simple for the local model - we 
calculate AG®’'®p® for the binding of a staple to an oth¬ 
erwise empty origami, and use this value throughout the 
simulation regardless of the state of the origami (we note 
that in this limit, the local and global models are ex¬ 
tremely similar and the local model is also thermody¬ 
namically well-defined). The results presented in Table 
|IV| show that, in the absence of cooperative interactions 
between loops, AT^ increases with 7 as expected. 

2. Comparison of local and global models 


Fig. shows the predictions of both global and lo¬ 
cal models. Melting and annealing curves are plotted 
for coaxial stacking parameter n = 2 , loop exponent 
7 = 2.5, staple concentration 20 nM and temperature 
gradient \dT/dt\ = 1.0°C min“^. The predictions of the 
two models for the average probabilities of staple incor¬ 
poration are very similar, consistent with data presented 
in Table The response for the single staple X shows 
more variation between the two models. 













